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Abstract. We develop a "weak Wazewski principle" for discrete and contin- 
uous time dynamical systems on metric spaces having a weaker topology to 
show that attractors can be continued in a weak sense. After showing that the 
Wasserstein space of a proper metric space is weakly proper we give a sufficient 
and necessary condition such that a continuous map (or semiflow) induces a 
continuous map (or semiflow) on the Wasserstein space. In particular, if these 
conditions hold then the global attractor, viewed as invariant measures, can 
be continued under Markov-type random perturbations which are sufficiently 
small w.r.t. the Wasserstein distance, e.g. any small bounded Markov-type 
noise and Gaussian noise with small variance will satisfy the assumption. 



In this paper we are going to show that the invariant measures of a dynamical 
system having a global attractor (either discrete and continuous time) can be "con- 
tinued" under small (not necessarily bounded) noise. Instead of just showing that 
there is a stationary measure of the perturbed system "weakly" close to the original 
ones (see e.g. |Kif881 1.7]) we show that it is close w.r.t. the Wasserstein metric w p 
(the order p depends on regularity of the noise) . Previous research mainly focused 
on Gaussian type noise, "absolute continuous" noise or assumed implicitly bounded 
noise, e.g. Kifer [Kif88, p. 103] and L.-S. Young [You86j considered noise on a posi- 
tive invariant (bounded) neighborhood U of a local attractor which is zero on dU 
and thus is bounded. If X is compact than any noise will be bounded. Thus we 
are in particular interested in non-compact X, although we restrict our attention 
to proper metric spaces which includes all locally compact geodesic spaces. 

All our results apply equally to discrete and continuous time dynamical system. 
We will mainly focus on discrete time because there is a better intuition behind 
these. In the first section we extend Rybakowski's continuation of a positive in- 
variant isolating neighborhood to the discrete time setting which will be the key 
step to treat continuous and discrete time systems on equal footing. Using a kind 
of "weak Wazewski principle" we show that attractors can be continued in a weak 
sense without assuming admissibility of the perturbed system (theorem [3]) . 

Then we introduce the Wasserstein space and show that the Wasserstein space of 
a proper metric space is weakly proper, i.e. closed ^-neighborhoods of compact sets 
are weakly compact (theorem [6]). We give a necessary and sufficient condition such 
that a dynamical system / (resp. a semiflow tt) on a proper metric space makes 
the transfer map /* : V(X) — » V(X) (resp. transfer semiflow 7r*), which is always 
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(weakly) continuous, (strongly) continuous on any Wasserstein space (P p (X),w p ) 
of order 1 < p < oc. 

Finally, we look at Markov-type perturbations of a dynamical system / (resp. 
semiflow ir) having a global attractor. These are perturbation of /* in the Wasser- 
stein space. Under quite general assumptions on / we can show using Conley 
theory that if the perturbation is sufficiently small then the perturbed system has 
an isolated (weak) attractor and there is at least one stationary measure (theorem 
IT5]) . The perturbed attractor is strongly close to the unperturbed attractor and 
contains all invariant measures and its positive invariant isolating neighborhood is 
convex. If the noise is of order p then the mass of the invariant measures decays 
at least as R~ p where R is the distance from the global attractor of / (resp. 7r). 
Furthermore, a standard result from Conley theory shows that if the noise level 
converges to zero then the set of invariant measures converges in the Wasserstein 
space to the set of invariant measures of the deterministic system (as usual without 
further assumptions only upper semicontinuity holds). 

Because of the special form of the Kantorovich- Rubinstein duality in V\(X) we 
can show that the (local) weak attractor of the perturbed system P : V\(X) — > 
V\ (X) is actually the global weak attractor of P. 

The framework of a metric space with a weaker topology used here is similar to 
the framework in |AGS08[ section 2.1] used to construct general gradient flows. 

Motivation. Consider a dynamical system / : X — > X on a proper metric space 
X having a global attractor, i.e. a compact invariant set A that attracts all its 
(bounded) neighborhoods. We will not consider the map / itself, but the map 
/* : V\{X) — > Vi(X) defined via push- forward map on the space of probability 
measures. If / is "nice" then /* is continuous and has the global attractor K — ViA), 
i.e. the probability measures supported on the global attractor of /. 

Markov-type noise can be considered as a perturbation F of /*, i.e. instead of 
fix ^ 3f(x) we have S x i-> p(dy\x) where p(dy\x) and 8f( x ) uniformly close w.r.t. 
the Wasserstein distance w\ for all x. This can be seen as a smearing of the image 
f{x) or some uncertainty about the actual image. For example, if / is the time-1 
map of a flow generated by the ODE x = g(x) then F could be the time-one map 
of the flow of distributions of the SDE dx = g(x)dt + edWt, i.e. additive Gaussian 
noise with small variance. 

If F and /* are sufficiently close then F has a (weak) global attractor (in V\(X)) 
which is close to K w.r.t. wi. Hence stability of the global attractor holds in the 
Wasserstein space Vi(X). 

The following example is inspired by Crauel, Flandoli - "Additive Noise Destroys 
a Pitchfork Bifurcation" |CF98| and could be decribed as "Additive Noise Destroys 
Attractors". The noise will be worse than white noise used by Crauel and Flandoli, 
but can still be considered as small. 

Example (Generic collapse under "small" noise). (1) Suppose / : X — > X has a 
global attractor and at least one fixed point xq (the argument works equally well 
with general attractors). Take any noise level e > and let P e : V(X) — > V(X) be 
the Markov map induced by 



x H- (1 - e)<5/( x ) + e<W 
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This map is (weakly) close to the unperturbed system /* : V(X) — s> V(X). Namely, 
if <1lp is the Levy-Prokhorov distance (which metrizes V(X)) then 

sup d LP (P e (iJ,), /*(/•«)) < e. 
peP(X) 

But P has exactly one invariant measure, namely S XQ1 and all others converge to 
this measure weakly. 

(2) Now we want to show that this can also happen in any Wasserstein space 
V P (X) for 1 < p < oo (Voo(X) only allows bounded noise which when sufficiently 
small cannot destroy local attractors and thus a global attractors with at least two 
sinks never collapses, see [Kelllj ). Suppose a : X [0, 1] is a continuous function. 
Define q a (dy\-) : X ->• V P (X) by 

x H> q a {dy\x) = (1 - a(x))S f(x) + a(x)5 X0 , 

which is obviously continuous. Thus 

w P (q a (dy\x), &f{ x )) p = a(x)d(f(x), x Q ) p . 

So if we define 

e p 

a{x) = l + d(f(x),x Q )P 

then 

( (a i ^ x \p eP ' d (f( x )> x o) p s P 

Wp(q a {dy\x),5 s(x) y = < e p . 

1 + d{f{x),x )P 

So, in particular, the induced MW-map Q a : V P (X) — > V P (X) of order p relative 
to / has noise level e. Furthermore, the only invariant measure of Q a is 6 Xo and all 
other measures converge to it. 

The example above should make clear that using arbitrary unbounded noise even 
when it is small can have strange effects on the global attractor. Although we have 
some "attracting" invariant measures of the perturbed system the attractor might 
look very different from the original one, in our case it might be just one fixed 
point and this one can even be the "most" unstable one of the original attractor. 
Therefore, stochastic stability of attractors under arbitrary "small" noise should not 
be referred to a single invariant measure but to all of them, even though we can 
speak of stochastic stability if the type of noise is more restricted, besides of being 
sufficiently "small". 

1. DISCRETE-TIME CONLEY THEORY FOR STABLE INVARIANT SETS 

In this section we will use Conley theory, that is continuation methods from 
Conley index theory without using the topological (or (co)homological) Conley 
index. We will prove a continuation for a positive invariant neighborhood of a 
stable isolated invariant set of a time discrete dynamical system. The result will 
not require a compactness assumption (called admissibility) of the perturbed system 
and is a different type of continuation than [MR9I]. Our proof will follow the proof 
of |Ryb87, Theorem 12.3] which is the continuation for semiflows. In particular, 
the results stated here and in the next sections also hold for semiflows if we assume 
that they do not explode on a given neighborhood. 

In both cases the Wazewski principle for the index pair of the perturbed system 
does not apply. But we can use other assumptions to show that attractors continue, 
e.g. the map is weakly continuous and closed (5-neighborhoods of compact set are 
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weakly compact, which is the case for the Wasserstein space on proper metric 
spaces. 

We will now give the definitions used in |MR91) and [Ryb87| to prove the ex- 
istence of an index pair for certain isolated invariant sets. Our setting will be a 
complete separable metric space Y and a dynamical system, i.e. a continuous map 
/ : Y — >• Y. A full left solution of / in N is a sequence {x- n } n ^ C N such that 
f(x n -i) = x n for n < 0. Define the following sets 

A+(N) = {xeN\ f k (x) £ N for all k > 0} 

A~(N) = {x S N\ 3 full left solution {x- n }n£Z in N through xo — x} 
A(N) = A+(N)nA-(N). 

These are called the maximal positive invariant (resp. negative invariant, resp. 
invariant) set in N. If N is unbounded then A(N) usually denotes only the bounded 
invariant orbits instead of all of them. A set K is called invariant if A{K ) = K. If 
there is a closed neighborhood N of an invariant set K with A(N) — K then K is 
called isolated with isolating neighborhood N. 
For (,m£N and I < k define 

fVM( x ) = {y\f k ( x ) = y for some k £Nn[!,m]} 

If f n : Y — y Y is a sequence of continuous maps such that /„ — > f, i.e. f n (x n ) — > 
f{x) whenever x n — > x as n — > oo, then we say that a closed bounded set N is 

{fn}~ admissible if for any sequence {x„} n£ N with /n°' m "'(a;n) C and m n oo 
the sequence of endpoints {/™™ (a; Tt )}„ e N is precompact. In case this property holds 
for f n = f then we just say N is /-admissible. 

Remark. Later on, we deal with dynamical systems on the space of probability 
measures Y = V{X) for some metric space X. An invariant measure for that 
system is invariant w.r.t. the definition above. In particular, periodic measures 
will be called invariant. A fixed point for these systems will be called stationary 
measure. 

In the following we will use several ideas from |MR91| : Let N, N' be two /- 
admissible isolating neighborhood for some isolated invariant set K with 

N c intiV'n/^intiV'). 

The authors in [MR911 4.4] used a so called Lyapunov pair (0, 7) which is continuous 
on a small neighborhood W C N of K and has the following properties: K C 
7 _1 (0), <f> (resp. 7) is decreasing (resp. increasing) along orbits and <j>(x) = with 
x G W implies 

x e A-{N)l)dN'. 

Because K is compact we can choose d(W, dN') > and assume x G A~(N) 
whenever <p(x) = 0. Furthermore, it is shown that if 4>{ x n) then x n admits a 
convergent subsequence. 

Theorem 1. Suppose N is an isolating neighborhood for K such that the assump- 
tions above hold and 

A~(N) = A(N) = K ^ 0. 

Then there exists an admissible isolating neighborhood B C N which is positive 
invariant, i.e. no trajectories exit B. 
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Remark. This is the discrete time version of |Ryb87[ 1-5.5] using the theory of 
[MR91] . The proof is essentially copied from Rybakowski using the Lyapunov pair 
above. 

Proof. Define 

P{ = N n cl{a; e int N' \ <j>{x) < e} 

Pi = Pi\{% G int N' | 7(a) < e}. 

it was shown |MR91| 4.4] that Pf C is a neighborhood of K for sufficiently small 

e > and whenever a; G P/ and /(x) £ N then a; £ Pf and if a; € Pf and f(x) £ N 

then sc e P|, i.e. P| is the exit ramp for Pf. 

Now fix a sufficiently small e > and let < 5 < e then P* c Pf. Define 

P2 := Pf nP| then because <p is decreasing along orbits P| is still an exit ramp for 
P s 
2 ' 

If A" (iV) = A(N) = K then we claim that there is a 6 > such that P| = 
which implies that Pf 7 is positive invariant. If this does not hold then there is a 
sequence x n £ P 1 "nP| with S n — > 0. Thus (j)(x n ) and 7(x n ) > e which implies 
that there is a subsequence x„< x E N such that </>(x) = 0. Hence 

x e A~(N) = K 

and 7(x) = by assumption. But 7 is continuous and x n > — > a; implies e < 7(x n ') — >• 
7(2;) = which is a contradiction. This proofs our claim and thus the theorem. □ 

In the following we will assume that K satisfies the assumption of the theorem 
and that B :— pf°, <j> and 7 are given as in the proof. It is obvious that Pf* is 
positive invariant w.r.t. / for any < 5 < 5q. Furthermore, suppose /„ — > /. 

Theorem 2. Assume N' (see above) is {f nm }-admissible for each subsequence of 
{/nJngN- Set U — int B and define 

V(a) = {xe U\4>(x) < a}. 

Then for some ao > ; N := ciy(ao) C U. Furthermore, for some sufficiently 
small €q > and all < e < eo there is an no = Tio(e) such that for all n > uq 
there is a positive f n -invariant closed N n (e) and 

K» cV{e)c N n (e) C N. 

Remark. The complete continuation theorem for index pairs does not hold for dis- 
crete time dynamical systems in general. A proof would require that there is a 
neighborhood such that the exit time is continuous in U , i.e. -4- uj + (xo) 

whenever x n — > xq in U, which holds for semifiows only for so called isolating 
blocks. These blocks do not necessarily exist for continuous maps. 

Proof. By |MR91[ 3.9] there is an a Q > such that N = c\V(a ) C U. And similar 
to |Ryb87[ 1-4.5] we can show that for < e < ao and all n > no(e) 

K n C V(e). 

Define 

JV n (e) = N n cl{y \ 3x G V(e), m > s.t. f [ n M {x) C £7 and f™(x) - y}. 

Following the proof of |Ryb87[ 1-12.5] we can show that N n (e) satisfies the following 
properties for n > no(e) 
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• x G N n (e) and f n (x) G N implies f n (x) G N n (e) 

• K n C V(e) C N n (e) 

We claim that for small eo > whenever e < eo and n > no(e) then N n (e) is positive 
invariant w.r.t. /„. If this is not true then there is a sequence e m — > and 

Vm € A„ m (e m ) 

with fn m {Vm) £ N. By definition of N nm (e m ) there is a sequence y rn G Y", 
im G V(e m ) and fc m > such that d(y m ,y m ) < 2~ m , fn m km] {x m ) C {/ and y m = 
fn™(x m ). Because <p(x m ) — > and Aj(B) = Af(B) we can assume w.l.o.g. that 
x m — > xq G Af(B). Admissibility and f nm — > f imply the sequence {/^™(x m )} m6 N 
has a convergent subsequence and w.l.o.g. y m — fn™(x m ) — > ?yo G Aj(N') — 
Af(N') G int N and thus y m — > j/o- 

Since f„ m (y m ) f N G int N' n /-^intiV') and /„ -> / we have f„ m (y m ) ->• 
/(j/o) G N'\mtN. But yo G A f (N') implies /(y ) G Af(N') which is a contradic- 
tion because Af(N') and AT'\int are disjoint. □ 

Corollary 3. Under the assumption above for all n > no we can find positive 
j n -invariant N n ,N n such that 

N n C Us(K) G N n C B 
for some 8 -neighborhood of K denoted by 

Us{K) = {x G Y | d(x, y) < 8 for somey G K}. 
Furthermore, we have K C int N n and there is an e > such that 

U e (K n ) C N' n . 
Proof. Applying the previous theorem we get 

K U K n C V(e) CN n cB. 
Recalling the definition of it is obvious that because Aj(_B) = Af(B) = K for 
small < e' < 1 and .t G Pf' 

d(x,K)<e'. 

Because K is compact and V(e) a neighborhood of K the <5- neighborhood Us{K) of 
AT is contained in V(e) for 8 sufficiently small. Furthermore, we can find an e' > 
with e' < 8 such that Pf C Us{K) C V^(e). Applying the theorem again for Pf 
instead of B we get for n > n 

N n c P^' C C^(A-) C < G B 

and X G V(e') C JV„. 

To show that U € (K n ) G A„ we need another positive /„-invariant neighborhood 
N n . First note that there is a 8' > such that 

d(x, K) > 8' 

for all x G dP e . So if we choose < 2e < 8 then 

Applying the previous theorem again we get a positive /„-invariant isolating N n of 
K n inside of Pf . Hence 

c/ e (A-„) g u £ (K) c c/ e (pr) c p( c a;. 



STABILITY OF THE GLOBAL ATTRACTOR UNDER MARKOV- WASSERSTEIN NOISE 7 

□ 

Now we are able to continue the attractor. Instead of an admissibility assumption 
for the perturbed map we will use weak compactness of close (^-neighborhoods of 
compact sets. 

Definition 4 (weak attractor). Suppose Y has a weaker (Hausdorff) topology 
(i.e. x n — > x strongly implies x n — v x weakly) and / is continuous and weakly 
continuous. An isolated invariant set K is called a weak attractor if it admits 
a positive /-invariant isolating neighborhood N such that u; wcak (A) C K where 
w wcak (JV) is defined as 

W ™*(JV) = {yeY\ 3x n e N,m n -»• oos.t. f m "(x n ) - y}. 

Remark. (1) Our definition of weakness of an attractor is w.r.t. the weaker topology 
and is different from one defined in [Hur Olj. Even our definition of a (strong) 
attractor is weaker than the one used there because we only require the existence 
of a positive invariant isolating neighborhood of the invariant set. But there might 
be a connection to Ochs' weak random attractor [Och99 . 

(2) A Conley theory with weak-admissibility instead of admissibility might not 
make sense since the continuation proof requires continuity of the metric and usually 
the metric is only lower semicontinuous w.r.t. weak convergence. 

(3) A weakly continuous function might not be continuous and vice versa (see 
counterexample in the proof of theorem [8]) 

Theorem 5. Under the assumption of the previous theorem, suppose there is a 
weaker (Hausdorff) topology on Y and that (strongly) closed 6 -neighborhoods of 
compact sets are weakly (sequentially) compact, i.e. c\Ug(C) is weakly compact for 
compact C . If f n is weakly continuous then K n is non-empty and a weakly compact 
weak attractor w.r.t. f n for all n > uq. Furthermore, U e (K n ) C N n for some e > 
and positive f n -invariant N n . 

Proof. Applying the previous corollary we get 

N n c U S (K) C N n C B 

and 

U e (K n ) C N' n . 

Because /„ is weakly continuous, N n is positive /„-invariant and clUs(K) C N n is 
closed and thus weakly compact the set 

< eak (x) = {y £ Y | f^(x) - yfor somen fc -> oo} 

for x £ N n is non-empty and weakly compact. This implies K n ^ and in 
particular 

< eak (a;) Cf„CiV„. 

Similarly weak compactness of c\Us(K) implies w™ eak (A^„) C A n {N n ) = K n C N n 
and thus K n is a weak attractor. Obviously K n is weakly closed and contained in 
the weakly compact set clUs(K) and is therefore weakly compact as well. □ 
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2. WASSERSTEIN SPACES 



Now we will introduce some notation and results for Wasserstein spaces of a 
metric space, general references are |AGS08| and [Vil09 . 

Let (X, d) be a complete separable metric space, also called Polish space. We 
call it proper if every bounded closed set is compact. In particular, this implies 
that X is locally compact. The metric of a non-compact proper metric space is 
necessarily unbounded. 

The space of probability measures on the Borel cr-algebra of X is denoted by 
V(X). This space is given the weak topology, i.e. fj, n — fx if J fdfi n — > J fd\i for 
all bounded continuous functions / . Let xq be an arbitrary point of X and define 
V P (X), the Wasserstein space (of order p), by 



where it G IL(n, v) C V{X x X) with 7r(A x X) = /x and ir(X x A) = v for all 
Borel sets A and B. Then (V p (X),w p ) is a complete separable metric space. This 
topology is usually stronger than the induced subspace topology of V P (X) C V(X). 

If X is compact so is V P (X). And V P (X) is local compact only if X is compact. 
A counterexample for non-proper metric spaces is given in [AGS08, 7.1.9]. We will 
adjust their example to non-compact proper metric spaces by showing that the 
closed e-ball B e p (S Xa ) around S X() in V P (X) cannot be compact for any e > and 
thus V P {X) cannot be locally compact. 

Example. Assume X is non-compact and proper and define 

Mn = m n S Xn + (1 - m n )5 X0 

for some sequence {n n } ie H C X . Then 

w p (^ n ,S XQ ) p = m n d(x ni x ) p . 

Suppose d(x n ,x ) > e > and set m n = e p ■ d(x n ,x )~ p then fj, n G dB™ p (5 Xo ). 
If {rn n }„gN stays bounded away from then {d(x ni xo)}neN is bounded and thus 
{x n } n< zfi and {m„}„ 6 N have convergent subsequences x n i — > Xoo and m n > — > m G 
(0, 1] and fj, n — > mS Xac +(1— m)8 Xo strongly in V P (X). But if we assume d(x n , xq) —> 
oo then m n — > and thus fj, n S Xo weakly. Because strong convergence requires 
that w p {ji n , 8 XQ ) — e converges to the sequence cannot converge strongly in V P (X). 

Even though Wasserstein spaces are in general not locally compact we can still 
show that the following holds for proper metric spaces. The result is probably 
known or at least implicitly used in case X = M™. Because it will be our main 
reason why the "weak" Conley theory is applicable and because we couldn't find 
any reference, we will prove it completely. 

Theorem 6. If (X, d) is a proper metric space then all closed S -neighborhoods of 
compact sets in (Vi(X),wi) are weakly compact, where the weak topology ofVi(X) 
is the induced subspace topology V\{X) C V(X). A space, e.g. Vi(X), having this 
property may be called weakly proper. 



V p {X) = {n£ V{X) | / d(x ,x) p d[i(x) < oo}. 



Furthermore, define for /!, v G V P {X) 




inf 

7ren(;i,!/) 
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Corollary 7. For 1 < p < q closed S -neighborhoods of compact sets in (P q (X),w q ) 
are compact in (V p (X),w p ), i.e. V q {X) is weakly proper w.r.t. the subspace topology 
induced by V q (X) C V P (X). 

Remark. This is stronger then a compact embedding i : (P q (X), w q ) — > (V p (X),w p ) 
because if yu„ G V q (X) is bounded then w.l.o.g. i([i n ) — > yU* in V P {X) and neces- 
sarily [i* G V q (X), i.e. bounded sequences never "leave" the space. 

Proof of theorem. We will show that 

B? :={veV^X)\w^v,S X0 ) <r} 

is weakly compact for all r > 0. Since B™ is closed and [i n — 1 fi implies w\(fi n ,6 Xo ) < 
liminfn^oo w\{ii n ,8 Xo ) < r we only need to show that B™ is tight. 

Tightness of a subset K. C V\{X) means for all e > there is a compact K e such 
that for all \x G K, 

l*(X\K e ) < e. 

For n G B™ we have 

/ d(x,x )d/j, = wi(fj,,5 Xo ) < r. 
J x 

Now choose K e = Br. (x ), the closed ball around xq with radius |, which is compact 
because X is proper. Then we have 

H(X\K e ) = / d/j,(x) < - / d(x 7 xo)d/i(x) < e. 
Jx\K e r J X\K € 

Thus the closed ball in 'Pi(X) around <5;r is weakly compact. 

Let K C V\{X) be a compact set , e.g. K — {/x}, then the closed i?-neighborhood 
around K is defined as 

N r( K ) = i v G PiWhO*,") < -R for some £ X}. 
This set is closed and bounded and for some R we have 

N%{K) C B%. 

Let f„ G Nft(K) be an arbitrary sequence. Then there are /j, n G K with 
< -R. Because B^ is weakly compact and K is compact there are 
^oo S and /^oo G if such that for some subsequence (also denoted by fi n , resp. 

Since u>i(-, •) is weakly lower semicontinuity we have 

wi(fi, v) < liminf u>i(^„, u n ) < R, 

n—>oo 

i.e. v G Nft(K) which implies weak compactness. □ 

Proof of corollary. We only show that B™ q (5 X0 ) is weakly compact w.r.t. the in- 
duced subspace topology V q {X) C V P (X) for 1 < p < q. The rest will follow by 
the same arguments used above. 

Assume {/J, n }neN C B r q (8 Xo ). Since w q < u>i the previous theorem implies 
w.l.o.g. fj, n — »■ /L^oo for some ^ G V\{X). Because 

Wg(Atoo,4 ) < liminf w q {n n ,5 Xo ) < r 
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we actually have fioo g Br 9 (S Xo ) C V q (X). 
Because 1 < p < q 

If r 

d(x,x ) p dfi n (x) < — — / d(x,x ) q did n {x) < — — 

x\b r M p Jx\b r uq 



Hence 



f r 
lim limsup / d(x,Xo) p d/j, n (x) < lim — — = 0. 

fl^oo n ^oo Jx\B n R-HX Rl P 



>X\B R 



This and the fx n weakly show that //„ —> /ioo in V P (X) (see [Vil09, 6.8]). □ 

In the following assume that X is proper. Suppose now / : X — > X is a continu- 
ous map having a global (set) attractor, i.e. there is a compact /-invariant A <Z X 
such that for all bounded sets B 

lim d(f n (B),A)=0, 

n—toc 

where d is the semi-Hausdorff metric induced by d such that d(A, B) = iff A C 
cl£>. The map / induces a continuous map /* : V{X) — > V(X) with /*(/i)(B) = 
/i(/ _1 (i?)) for all Borel set B. Furthermore, under slightly stronger assumptions 
/* has a global attractor 

V(A) = {neV(X)\n(A) = l}. 

Remark. For compact X the global attractor is always X itself. In particular, since 
V P {X) = V(X) for 1 < p < oo is compact the global attractor of V(X) is the space 
itself and we don't get new information. The whole theory is only interesting for 
non-compact proper metric spaces X. 

Since the Wasserstein space includes distance the behavior of / at infinity be- 
comes important. 

Theorem 8. The map /» induces a continuous map (V p {X) 1 w p ) — > (V p (X),w p ) 
(also denoted by f*) if and only if for some xq G X 

d(f(x),x ) 

SUp -. r < OO. 

xeX 1 + d(x,xo) 

Remark. For a semiflow it we need that sup^g^ d ( x ^ t - x o)/i+d(x,x ) — Mt < oo for 
t € [0,7b]. Which means, in particular, that there has to be a global lower bound 
on the blow-up time and thus there cannot be blow-ups at all, i.e. tt has to be a 
global semiflow. Which implies that the induced semiflow 7T* on V P (X) is also a 
global semiflow. A necessary requirement for the existence of a global attractor is 
an upper bound on M t for all t > 0. The requirements in [AGS08 ( Chapter 8] are 
sometimes too strong. A sufficient condition is that a one-sided Lipschitz condition 
holds globally (e.g. v{x) — x — x 3 is an unbounded vector field and only locally 
Lipschitz, but satisfies a one-sided Lipschitz condition). 



Proof. Suppose first that 



d(f(x),x ) 

M = SUp -. r < CO. 

xex 1 + d{x,x ) 
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We can assume w.l.o.g. M > 0, otherwise f\x = xq and the result is obvious. For 
fj, € V P (X) we have 



d(x,x ) p df*n(x) = J d(f(x),x Q ) p dfi(x) 

< M p [{1+ d{x, x )) p dfi{x) < oo, 



i.e. G V P {X). So we only need to show continuity. 

Suppose )i n — > n in V P {X) then /*/i n — ^ Since d(f(-),xo) p is continuous 

and grows at most like d(-,xo) p it follows that 



d(x,x ) p df*fi n (x) = I d(f(x),x ) p dfi n (x) 

d(f(x),xo) p d)jL(x)= I d(x,x ) p df r fi(x). 



Thus -> (see |Vil09| 6.8]) which shows that /» : ("P P (X), w p ) -> ("P P (X), w p ) 
is (strongly) continuous. 

It remains to show that /» is not continuous if there is a sequence {a; n }„gN C X 
such that 

, d(f(x n ),x ) 
1 + d(a;„,xo) 

Because X is proper and / continuous we must have d{x n ,X{)) — > oo. For large n 
we can assume < -j- < d(x n ,xo). Set c n = -j- then 

and w p (fi n , 5 Xo ) p = c£ -> 0, i.e. strongly in V P {X). We have 

/*^„ = c p — — - — — S f(Xn) + (1 - c£ — — rr)Sf( Xo ) 

d(x n ,x ) p d(x n ,x )P 

and therefore 

w (f^nJJx Y = (f &^h^L = 1 + d{x n ,x y ^ l 

P * * X ° n d(x n ,x ) p d(x n ,x ) p 
which implies that /* cannot be (strongly) continuous in 5 XQ . □ 

The following results will hold for any V P (X). To simplify the notation and some 
of the proofs we will just state them for V\{X). Furthermore, we assume from now 
on that /* : Vi(X) —> Vi(X) is strongly continuous (for short just continuous) 
and whenever we speak about /* we mean the map /* : Vi(X) — > V\{X). Since 
f*(Vi(X)) C Vi(X) (for /, : V(X) -> V(X)) this also implies that /» is weakly 
continuous in Vi(X). Similarly we could say that /» : V q {X) — > V q (X) is continuous 
and "weakly" continuous in V q (X) w.r.t. the induced subspace topology of V q (X) C 
V P (X) for any 1 < p < q. 

Example. Having a global attractor does not imply that /* is strongly continuous, 
even finite time compactness is not sufficient: Let X be M with the Euclidean metric 
| • |. Define / : R -> M by 

JO a;>0 
x z x < 0. 



STABILITY OF THE GLOBAL ATTRACTOR UNDER MARKOV- WASSERSTEIN NOISE 12 



Then / is continuous and f 2 = but for x n = 

d(f(x n ),0) n 2 



— > oo, 



l + d{x n ,0) l + n 
i.e. /* is not continuous on (P p (X),w p ). 

If K C V\{X) is invariant w.r.t. /* then it is invariant w.r.t. /* : P(X) — > V(X). 
Which implies that all measures in K are supported on the global attractor A of 
/. Since P(A) C Pi(X) the maximal invariant set of Pi(X) is P(A) = Pi{A). 

Suppose / is finite time compact, i.e. there is an m such that f m (X) C B R for 
some compact set B R . It should be obvious that this implies K = V\ (A) is the 
global attractor of /*. Furthermore, we have the following: 

Proposition 9. Suppose for some m > 0, 

f m (X) c B R (x ). 

Then /* is finite time compact and thus any closed set B w C Pi(X) is /* -admissible. 

Proof. Let {v n }neN be any sequence in f™(Pi(X)). Then there is a sequence 
{^n}nGN such that v n = f™{n n ). f m {X) C B R (x ) implies suppj/„ C B R (x ). 
Thus i/„ is tight and 



/ 

Jd( 



d(x, x$)dv n {x) = 0, 

/ d(x,xo)>R+e 

i.e. {v n }neN has uniformly integrable first moments. Which means that {v n }nen 
has a convergent subsequence. Therefore, f™(Pi(X)) is compact which easily im- 
plies admissibility for any closed B w . □ 

Proposition 10. Suppose there is an R n , < c < 1 and m > such that for all 
R > Rq 

f m (B R ) c B cR . 
Then any bounded closed set B w C P\(X) is /*- admissible. 

Proof. Let {/i rl } rie N be a sequence in such that /i°' m "' C -B™ for some 
m„ — > oo. Because i?" 1 is bounded we have 

/ d(x, xo)d(j, n (x) < M. 
J x 

First assume m n — k n ■ m for an unbounded sequence k n <E N. Then for R> R 

J XX\B R (X) ■ d(x,X )df™^ n (x) = J XX\B R {f mn {x)) ■ d(f m " (X), X )dVn(x) 

< J Xx\B c _ knR (x) ■ c kn d(x,xo)d^n(x) 

< c fe "M^0, 

which shows that {f™"(Hn)}n£f$ has uniformly integrable first moments which im- 
plies that the sequence has a convergent subsequence. 

If m n ^ (mod to) then for some < l n < m we have m„ — l n = O(modm). 
Therefore, if we set v n = f \ n (/x) then the argument above applies to v n and the 
sequence of endpoints (which is equal to {/™"(Mn)}nGw) has a convergent subse- 
quence. □ 
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Remark. Kifer used in [Kif88| Theorem 1.7] linear attraction instead of exponential. 
This might not be sufficient for admissibility. Nevertheless, later we will assume 
that a Markov-type perturbation of /* is small in the Wasserstein distance which is 
stronger than Kifer's assumption and thus an invariant (probability) measure exists 
for the perturbation by the same theorem. But that theorem does not imply that 
the perturbed and unperturbed invariant measures are close w.r.t. the Wasserstein 
distance, the perturbed invariant measures might not even be in the Wasserstein 
space. So our result improves this sufficiently. 

Before we show how to use Conley theory for small Markov-type noise applied to 
/ we give a sufficient condition such that bounded sets in V\ (X) are {F n }-admissible 
for F n ->■ /*. 

Proposition 11. Let B w be closed and bounded and U be a 5 -neighborhood of B w 
with /*- admissible closure. Suppose F n — > uniformly on some U, i.e. 

supu!i(/»(jU),F„(^)) = e„ -)• 0. 

If f* * s uniformly continuous in U then B w is {F n }- admissible. 

Remark. (1) The idea is to use the uniform convergence and uniform continuity 
to construct longer and longer orbits of /* close the the last part of the orbits of 
F n , i.e. the orbit f^ 0,m "~ k ™}(y n ) and F]i n,mn \x n ) should be closer and closer and 
m n - k n -> oo for y n = F% n (x n ). 

(2) Uniform continuity of /* and uniform convergence of F n — > /» are the assump- 
tions Benci |Ben91| used to prove his continuation theorem for the Conley index. 
Besides having continuous time dynamical systems he also needs invertibility. 

Proof. Let \i n G N and m n — > oo be sequences with Fn' mTl \^ n ). Uniform conver- 
gence of F n — > /» and uniform continuity of / imply that for some e(e„) — > as 
e n -> 

< e„ + e(e n ) =: e n ,2 ^ as n -> oo. 
Similarly we can show that there are — > as n — > oo such that 

w^F^Ji^)) <e„, fc ->0. 

Therefore, there is a sequence k n > with m n — k n — > oo such that i^°' m "l (/x n ) c 
B w implies that 

fl ' mn - kn] ("n)cU = U s (B w ) 
for v n — F% n (ii n ). Furthermore, we can choose k n such that 

S n = max e„ tk -> 
ke[0,m n — k n ] 

and therefore 

«; 1 (F„ m "(/x„),/r"- fcn K))<5n- 

Because the closure of U is /^-admissible, the sequence of endpoints {f™ n ~ kn (Vn)}neN C 
U has a convergent subsequence which implies that {F™ n (^, n )} n ^ has a convergent 
subsequence, in particular the limit point is in B w . □ 
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This proposition applies in particular to Lipschitz continuous functions / : X — > 
X because the induce map /* : V\{X) — > Vi{X) is Lipschitz continuous as well. 

E.g. suppose X = W 1 and / is the time h map of a flow generated by an ODE 
x = g(x) such that g satisfies the one-sided Lipschitz condition for some Met 

(x-y,g(x) -g(y)) < Af||a;-t/|| 2 

then / is Lipschitz continuous with constant e Mh . 



3. Markov-Wasserstein maps 

Definition 12. A Markov-Wasserstein map (MW-map) of order p is a continuous 
map P : P p (X) — > V P (X) which is convex linear, i.e. for fi, v S V P (X) and a S [0, 1] 

P{a^ + (I - a)v) = aP{^) + (f - a)P{v). 

Suppose P is induced by a kernel p{dy\x) : X — > V P (X) (necessarily continuous), 



P '■ dfi(y) i-> J p(dy\x)dfi(x). 



The map P — P¥ is called an MW-map of order p relative to / with noise level 
(at most) M if 

sup w p {p{dy\x),8 f(x) ) < M. 

x£X 

This implies by |Vil09| 4.8] 



w p (P(»),f*(ti)r < J w p (p(d y \x),S f{x) rd^x) < PP. 

Remark. As in the sections before, the results also hold for semiflows and a suitable 
definition for MW-semiflows, i.e. a continuous semigroups (Pt)t>o on Pp(X). The 
noise level model can be stated similarly, but we only require that it is uniformly 
small for all "small" t. We will focus here only on maps, resp. MW-maps, because 
the intuition behind these is easier, but all results also hold for MW-semiflows if 
the noise level is sufficiently small. 

MW-maps (resp. MW-semiflows) appear naturally in the theory of Markov 
chains (resp. processes). The map p(dy\-) : X —> V P (X) is the Markov transition 
probability function, whereas the map P : V P (X) — > V P (X) almost never has a 
name. If P = /* for some dynamical system / : X — > X then P is sometimes called 
transfer map. The following will show that we only need continuity of p{dy\-) to 
ensure that P is continuous in V P (X) (and thus for any V q (X), I < q < p and for 
V(X)). 

Theorem 13. Let p(dy\-) : X — > P p {X) be a Markov kernel, i.e. a measure-valued 
map, continuously depending on x. If M — swp x£X w p (p(dy\x),6 x ) < oo then P 
defined by 

P : dfi(y) i ^ J p(dy\x)dfj,(x) 

is an MW-map of order p relative to id : X — > X with noise level M. 

Remark. For MW-semiflows weak continuity of p t (dy\-) corresponds to Feller con- 
tinuity of the corresponding stochastic process. In fact, if the initial distribution of 
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(X™)t>a is 5% n , i.e. Xq = x n , then by our continuity requirement if x n —> xo then 
/j™ = PtS Xi — > Pt.5 Xa = $ which implies that 

u(x n ) = Eg(X?) = / g{x)dtf{x) / g(x)dtf(x) = Eg(X°) = u(x ) 



for all bounded continuous function g(x), i.e. the stochastic process generated 
by (Pt)t>o is Feller continuous. The continuity of the moments implies that, in 
addition, the moments are also continuous. This condition could be called p-Feller 
continuous. This type of continuity does not need t 6 1 and thus applies equally 
to Markov chains, i.e. discrete time stochastic processes. 

Proof. Continuous dependency implies that P : V P (X) — > V(X) is continuous. So 
we only need to show that ji n — > p in V P (X) implies that Pp n — > Pp in V P (X) 
Because w p (p(dy\x), S x ) < M for some M < oo we have 

d{y,x ) p p(dy\x) = w p {p{dy\x),S Xo ) p 

< (w p (p(dy\x),5 x ) +w p (S x ,S Xo )) p 

< 2 p ~ 1 (M p + d(x,x ) p ) 

This implies that g(x) = J d{y , xq) p p{dy\x) grows at most like d(x,xo) p . Fur- 
thermore, g is continuous because x H> p{dy\x) and [i h-> w p (/j,,5 Xo ) p are. Thus by 
[WOO! 6.8] 

d(y,x ) p dPp n (x) = J g{x)dp n {x) -> J g(x)dfi(x) = J d(y,x ) p dPfi(y), 

i.e. Pn n -> Pfi in V P {X). □ 
Example. (1) Bounded noise can be modeled by Markov maps with 

M = sup d(x, suppp(c?y|x)) < oo. 

xex 

Then w p {p{dy\x), S x ) < M and thus continuity of p(dy\-) : X — > P p {X) for some p 
implies that of P : V P (X) — > V P (X). This could also be used to model multi-valued 
perturbations, i.e. maps /„ : X — > 2 X with sup xg ^ d(f(x), f n {x)) < M. 

(2) Let X = R" with its Euclidean distance. If v is the standard normal dis- 
tribution then v — p(x)dx, where dx is the Lebesgue measure on R™. Any normal 
distribution with mean x and variance a 2 can be modeled as follows 

AVer 2 — 5 X * p a 

where p e (x) = (T n p(xj t). For er = we set p, Xt a — 5 X . 

since m p = J \x\ p p{x)dx < oo for all p this implies (see |AGS08| 7.1.10]) that 

w p (S x ,/j, Xt(T 2) < am p . 

Thus Gaussian noise with uniformly small variance is uniformly small in all Wasser- 
stein spaces (although the noise level diverges to oo as p — ¥ oo). 

Corollary 14. /// : X — >• X induces a continuous self map onV p (X) and p(dy\-) : 
X — > V P (X) is continuous with 

M= sup Wp(p(dy\x),8 f ( x )) < oo 
xex 

then P : V P {X) — > V P (X) defined as above is an MW map of order p relative to f 
with noise level M. 
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A random perturbation can now be modeled as a composition of / followed by 
a smearing via p(dy\-), i.e. /* followed by P = P^f . This corresponds to additive 
noise depending only on the image, whereas a general MW-map relative to / might 
smear the image f(x) and f(y) for x ^ y differently even if fix) — f(y). 

If for some sequence p n (dy\-) the noise level 

sup Wp(p n (dy\x),Sfi x )) = e n 
xex 

converges to zero then P n converges to /* uniformly on Vp{X), i.e. 

Sup Wp(Pn(fl),f*(fJL)) -> 0. 
^V P (X) 

An MW-chain relative to / satisfies the Markov property, i.e. future behavior 
only depends on the current state. Furthermore, this models only time-independent 
random perturbations. Time-dependent perturbations can be modeled with the 
result of [Kelllj . There it is shown that a local attractor can be continued if the 
non-autonomous perturbations is uniformly small. Translated into this framework 
this means 

sup w p (P((i,k),f*(ji)) < e 

ij.ev p (x),kez 

for the non-autonomous dynamical system (w inhomogeneous Markov map) 

-> (P( M ,fc),fc + l). 

Instead of using the semi-admissibility argument to show that the invariant set K n 
is non-empty we can use a weak compactness argument to get the same result. 

Example. Consider the ODE with x — x — x 3 . This satisfies the one-side Lipschitz 
condition with M = 1 and generates a global semiflow that attracts in finite time. 
Thus the time-one map induces a Lipschitz continuous map /* on V P {X) which 
attracts in finite time, too. Therefore, any bounded closed set in V P (X) is {F n }- 
admissible for F n — > /» uniformly. In particular, the MW-map of order p for small 
noise level has an attractor close to the original w.r.t. the Wasserstein metric w p . 

Theorem 15. Suppose f : X — > X induces a dynamical system /» on V P (X) 
having a global attractor and that /* is uniformly continuous in a neighborhood of 
the global attractor. If P n — > /* is a sequence of MW-maps of order p relative to f 
with noise level e n 0. Then for n > no there is a positive P n -invariant isolating 
neighborhood N n such that K n = Ap n (N n ) is non-empty and a weakly compact weak 
attractor which contains all bounded P n -invariant measures, i.e. Ap n (P p (X)) = 
K n . Furthermore, there is at least one stationary measure in K n . 

Remark. Suppose p > 1. Whenever K is (strongly) compact in V P {X) then 
c\U${K) is weakly compact w.r.t. the weaker subspace topology of V P (X) induced 
by V P (X) C V q {X) for any 1 < q < p. Thus for all /i € N n there is a /j,k € K n 

Pn(fJ-) — ^ (J>K- 

This means that, although the p-moment may not converge, any g-moment con- 
verges for 1 < q < p, but the convergence may get worse the closer q comes to 
P- 
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Proof. Everything but the existence of a stationary measure and Ap n (V P (X)) = K n 
follows from theorem [5] Since P n is convex linear K n must be convex. This implies 
that for any fi £ K n the sequence 

( k=0 J m£ N 

is in K n and thus weakly converging to some v £ K n and by the Krylov-Bogolyubov 
theorem it must be a fixed point of P n , i.e. v is a stationary measure of P n . 
Furthermore, if R is the distance from the global attractor of / then its mass must 
decay as R~ p . 

The invariant measures must all be contained in the interior of N n . Otherwise 
take fi £ Ap n (V p (X))\K n . If fj, is P„-stationary then the argument is as follows: 
For t £ [0, 1] and some /iq G K n the graph of t h-> t/j, + (1 — t)[io is stationary and 
intersects dN n , which implies K n — Ap n (N n ) intersects dN n . This contradicts the 
isolatedness of K n . 

For the general case assume a : Z — > P p (X) is a bounded full solution through 
fi, i.e. P n (a(k)) — a(k + 1) and <r(0) = fi. Now define the function g : Z — > [0, 1] 

g : sup{t £ [0, 1] | sa(k) + (1 - s)fi £ N' n for alls £ [0,t]}. 

Because N n is a positive invariant neighborhood of K ni fi$ stationary, P n convex 
linear and {o~(k)}k£z bounded and not entirely in N n we have 

0<5<g(k) <g(k+l). 

This implies that 

T = inf g(k) > S. 

fe<0 

Because <r(0) ^ N n we have T < 1 and thus by definition of g 

w p (Ta(k) + (1 -T)fj, ,dN' n ) -> as/c^-oo 

and thus there is a Tj < T such that o-(fc) = T\a{k) + (1 — Ti)/io is a full solution 
in iV n with 

a<) < I 

for some fci < 0. But a(k\) £ K n and U e (K n ) C A^ n which implies that 

e<«/ p (cr(fci),0<) < |. 

This is a contradiction and thus Ap n {T p (X))\K n = 0, i.e. K n contains all bounded 
invariant measures in V P (X). □ 

Corollary 16. The positive /„- invariant isolating neighborhood B and the positive 
P n -invariant isolating neighborhood N n can be chosen convex, i.e. if fa £ B (resp. 
fii £ N n ) for i = 0, 1 then fi t £ B (resp. fi t £ N n ) for /it = tfiQ + (1 — t)fj,i and 
tG [0,1]. 

Proof. Let Vi, fa £ V P (X) for i = 0, 1 and define /i t = t/ii + (1 — t)fiQ and i/< = 
ii^X + (1 — t)vo. Assume 
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Then there are optimal transference plans iti £ n(/ii,z/) such that 

J d(x,y) p dwi(x,y) < e p . 
The plan w t = tiri + (1 — t)iro is a transference plan for the pair (/i t , v t ) and thus 
w p (fit,^t) p < J d(x, y) p dn t (x,y) 

d{x, y) p dni(x,y) + (1 -t) J d(x,y) p dn (x,y) 



< t 

< te p + (1 - t)e p = e p 



The construction of B is done via a Lyapunov pair (</>, 7) essentially measuring a 
weighted distance of the forward orbit of a point, i.e. 

F N , : fi h-> mm{l, w p (fi, A- (N') U dN')} 

and 

4, : n sup{(2n + l)F N ,(f?(x))/(n + 1) | n G N, n < w^(x)} 

Let P| be defined as in the proof of theorem [TJ We can assume that w p {P{, dN 1 ) > 
2e for some for sufficiently small e > 0. Because A~(N') = A(N') is convex, 
(j>(fii) < e for i = 0, 1 implies 

4>(vt) < e - 

Hence P{ is convex and we can choose B = Pf for some small 5 > 0. Similarly 
V(a) defined in theorem [5] is convex. 
The set N n was defined as 

N n (e) = Nn c\{y 3x £ V(e), m > s.t. P,t 0,ml (a;) C L>and P™(x) = y} 

where U = ixitB and N = clV(eo) are convex sets. Hence N n (e) is convex . □ 

Corollary 17. Under the assumption of the previous theorem if p — 1 then all 
orbits of P n are bounded for n > uq, i.e. Pn'°°\fj,) C S^(/io) for some R>0 and 
some fixed fig. In particular, K n is the global weak attractor of P n . 

Remark. The idea of the proof is to control the distance of fix and fig by the distance 
of fi t and Hi where fig will be some stationary measure and t £ (0, 1] is sufficiently 
small. 

Proof. Using the Kantorovich-Rubinstein formula we have the following equality 
for n,v€ Vi{X) 



Wi(fi,v) = inf / d(x,y)di:(x,y) = sup < / <j>dfi — I <j)dv 
7ren(/i,i/j J ||0||Li P <i U J 

i.e. there is a sequence <f>h with ||0fc||Lip < 1 such that J tfikdp, — J 4>kdv /* w\(fi,v). 
Furthermore, there exist an optimal plan n £ n(/ii, v) such that the infimum is 
actually attained. 

Choose fig e Vi(X) and define fi t — tfi\ + (1 — t)fiQ for t 6 [0, 1] and fi\ £ Vi(X). 
We claim 

wi(fit,fj,o) = twi(m,(io). 
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Suppose 7r 6 II(/ii,/io) is the optimal plan and 4>k the sequence of Lipschitz maps 
as above. Then tt = tn + (1 — t)(id, id)*^o is in Yl(/j,t, fio)- Thus 

«Ji(//t,Mo) < j d(x,y)dn(x,y) = tioi(/ii,/x )- 
Furthermore, we have 

w\{ih,hq) > j (fikdfit - J <f>kdno = t (^J fikdfi! - J fadno 

Because the left hand side converges monotonically to tw\{n t -, Mo) we have proved 
our claim. 

Now fix some stationary measure fi a e if„. Since 7V„ is a neighborhood of if„ 
there is a i G (0, 1] for all /ii 6 X such that ^t t as defined above is in N n . Thus 

Pn'°°\fit) is in N n and bounded, i.e. wi(P^(fx t ), Mo) < ^ f° r some i?. Because P„ 
is convex linear and /i stationary we have Pnint) — tP^{fjb\) + (1 — i)/zo and hence 

1 

«;i(P*(/ii),/io) - -w 1 {P^{fx t ), ( i )<j, 
which implies that the orbit of /ii is bounded. □ 
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